Overview
Brought to you by YData
Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 627920 |
| Missing cells | 3008 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 302.6 MiB |
| Average record size in memory | 505.3 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 3 |
| Text | 3 |
| DateTime | 1 |
Country_Region has constant value "US" | Constant |
Confirmed is highly overall correlated with Deaths | High correlation |
Deaths is highly overall correlated with Confirmed | High correlation |
FIPS is highly overall correlated with UID | High correlation |
Lat is highly overall correlated with iso2 and 1 other fields | High correlation |
UID is highly overall correlated with FIPS and 2 other fields | High correlation |
code3 is highly overall correlated with iso2 and 1 other fields | High correlation |
iso2 is highly overall correlated with Lat and 3 other fields | High correlation |
iso3 is highly overall correlated with Lat and 3 other fields | High correlation |
iso2 is highly imbalanced (93.1%) | Imbalance |
iso3 is highly imbalanced (93.1%) | Imbalance |
Confirmed is highly skewed (γ1 = 39.35689783) | Skewed |
Deaths is highly skewed (γ1 = 63.59088753) | Skewed |
Lat has 20304 (3.2%) zeros | Zeros |
Long_ has 20304 (3.2%) zeros | Zeros |
Confirmed has 253223 (40.3%) zeros | Zeros |
Deaths has 428930 (68.3%) zeros | Zeros |
Reproduction
| Analysis started | 2025-11-30 19:08:19.748512 |
|---|---|
| Analysis finished | 2025-11-30 19:08:40.093828 |
| Duration | 20.35 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
UID
Real number (ℝ)
High correlation
| Distinct | 3340 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83429580 |
| Minimum | 16 |
|---|---|
| Maximum | 84099999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.8 MiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 84002170 |
| Q1 | 84018108 |
| median | 84029208 |
| Q3 | 84046120 |
| 95-th percentile | 84055081 |
| Maximum | 84099999 |
| Range | 84099983 |
| Interquartile range (IQR) | 28011 |
Descriptive statistics
| Standard deviation | 4314702.3 |
|---|---|
| Coefficient of variation (CV) | 0.051716697 |
| Kurtosis | 176.2819 |
| Mean | 83429580 |
| Median Absolute Deviation (MAD) | 12834 |
| Skewness | -11.171049 |
| Sum | 5.2387102 × 1013 |
| Variance | 1.8616656 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 188 | < 0.1% |
| 84041013 | 188 | < 0.1% |
| 84040015 | 188 | < 0.1% |
| 84040017 | 188 | < 0.1% |
| 84040019 | 188 | < 0.1% |
| 84040021 | 188 | < 0.1% |
| 84040023 | 188 | < 0.1% |
| 84040025 | 188 | < 0.1% |
| 84040027 | 188 | < 0.1% |
| 84040029 | 188 | < 0.1% |
| Other values (3330) | 626040 |
| Value | Count | Frequency (%) |
| 16 | 188 | |
| 316 | 188 | |
| 580 | 188 | |
| 850 | 188 | |
| 63072001 | 188 | |
| 63072003 | 188 | |
| 63072005 | 188 | |
| 63072007 | 188 | |
| 63072009 | 188 | |
| 63072011 | 188 |
| Value | Count | Frequency (%) |
| 84099999 | 188 | |
| 84090056 | 188 | |
| 84090055 | 188 | |
| 84090054 | 188 | |
| 84090053 | 188 | |
| 84090051 | 188 | |
| 84090050 | 188 | |
| 84090049 | 188 | |
| 84090048 | 188 | |
| 84090047 | 188 |
iso2
Categorical
High correlation Imbalance
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.3 MiB |
| US | |
|---|---|
| PR | 15040 |
| AS | 188 |
| GU | 188 |
| MP | 188 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AS |
|---|---|
| 2nd row | GU |
| 3rd row | MP |
| 4th row | PR |
| 5th row | PR |
Common Values
| Value | Count | Frequency (%) |
| US | 612128 | |
| PR | 15040 | 2.4% |
| AS | 188 | < 0.1% |
| GU | 188 | < 0.1% |
| MP | 188 | < 0.1% |
| VI | 188 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| us | 612128 | |
| pr | 15040 | 2.4% |
| as | 188 | < 0.1% |
| gu | 188 | < 0.1% |
| mp | 188 | < 0.1% |
| vi | 188 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 612316 | |
| S | 612316 | |
| P | 15228 | 1.2% |
| R | 15040 | 1.2% |
| A | 188 | < 0.1% |
| G | 188 | < 0.1% |
| M | 188 | < 0.1% |
| V | 188 | < 0.1% |
| I | 188 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1255840 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| U | 612316 | |
| S | 612316 | |
| P | 15228 | 1.2% |
| R | 15040 | 1.2% |
| A | 188 | < 0.1% |
| G | 188 | < 0.1% |
| M | 188 | < 0.1% |
| V | 188 | < 0.1% |
| I | 188 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1255840 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| U | 612316 | |
| S | 612316 | |
| P | 15228 | 1.2% |
| R | 15040 | 1.2% |
| A | 188 | < 0.1% |
| G | 188 | < 0.1% |
| M | 188 | < 0.1% |
| V | 188 | < 0.1% |
| I | 188 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1255840 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| U | 612316 | |
| S | 612316 | |
| P | 15228 | 1.2% |
| R | 15040 | 1.2% |
| A | 188 | < 0.1% |
| G | 188 | < 0.1% |
| M | 188 | < 0.1% |
| V | 188 | < 0.1% |
| I | 188 | < 0.1% |
iso3
Categorical
High correlation Imbalance
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.9 MiB |
| USA | |
|---|---|
| PRI | 15040 |
| ASM | 188 |
| GUM | 188 |
| MNP | 188 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ASM |
|---|---|
| 2nd row | GUM |
| 3rd row | MNP |
| 4th row | PRI |
| 5th row | PRI |
Common Values
| Value | Count | Frequency (%) |
| USA | 612128 | |
| PRI | 15040 | 2.4% |
| ASM | 188 | < 0.1% |
| GUM | 188 | < 0.1% |
| MNP | 188 | < 0.1% |
| VIR | 188 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| usa | 612128 | |
| pri | 15040 | 2.4% |
| asm | 188 | < 0.1% |
| gum | 188 | < 0.1% |
| mnp | 188 | < 0.1% |
| vir | 188 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 612316 | |
| S | 612316 | |
| A | 612316 | |
| P | 15228 | 0.8% |
| R | 15228 | 0.8% |
| I | 15228 | 0.8% |
| M | 564 | < 0.1% |
| G | 188 | < 0.1% |
| N | 188 | < 0.1% |
| V | 188 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1883760 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| U | 612316 | |
| S | 612316 | |
| A | 612316 | |
| P | 15228 | 0.8% |
| R | 15228 | 0.8% |
| I | 15228 | 0.8% |
| M | 564 | < 0.1% |
| G | 188 | < 0.1% |
| N | 188 | < 0.1% |
| V | 188 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1883760 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| U | 612316 | |
| S | 612316 | |
| A | 612316 | |
| P | 15228 | 0.8% |
| R | 15228 | 0.8% |
| I | 15228 | 0.8% |
| M | 564 | < 0.1% |
| G | 188 | < 0.1% |
| N | 188 | < 0.1% |
| V | 188 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1883760 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| U | 612316 | |
| S | 612316 | |
| A | 612316 | |
| P | 15228 | 0.8% |
| R | 15228 | 0.8% |
| I | 15228 | 0.8% |
| M | 564 | < 0.1% |
| G | 188 | < 0.1% |
| N | 188 | < 0.1% |
| V | 188 | < 0.1% |
code3
Real number (ℝ)
High correlation
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 834.49162 |
| Minimum | 16 |
|---|---|
| Maximum | 850 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.8 MiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 840 |
| Q1 | 840 |
| median | 840 |
| Q3 | 840 |
| 95-th percentile | 840 |
| Maximum | 850 |
| Range | 834 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 36.49262 |
|---|---|
| Coefficient of variation (CV) | 0.043730362 |
| Kurtosis | 109.29686 |
| Mean | 834.49162 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -8.5497067 |
| Sum | 5.2399398 × 108 |
| Variance | 1331.7113 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 840 | 612128 | |
| 630 | 15040 | 2.4% |
| 16 | 188 | < 0.1% |
| 316 | 188 | < 0.1% |
| 580 | 188 | < 0.1% |
| 850 | 188 | < 0.1% |
| Value | Count | Frequency (%) |
| 16 | 188 | < 0.1% |
| 316 | 188 | < 0.1% |
| 580 | 188 | < 0.1% |
| 630 | 15040 | 2.4% |
| 840 | 612128 | |
| 850 | 188 | < 0.1% |
| Value | Count | Frequency (%) |
| 850 | 188 | < 0.1% |
| 840 | 612128 | |
| 630 | 15040 | 2.4% |
| 580 | 188 | < 0.1% |
| 316 | 188 | < 0.1% |
| 16 | 188 | < 0.1% |
FIPS
Real number (ℝ)
High correlation
| Distinct | 3330 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 1880 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33061.685 |
| Minimum | 60 |
|---|---|
| Maximum | 99999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.8 MiB |
Quantile statistics
| Minimum | 60 |
|---|---|
| 5-th percentile | 5103 |
| Q1 | 19079 |
| median | 31014 |
| Q3 | 47131 |
| 95-th percentile | 72035 |
| Maximum | 99999 |
| Range | 99939 |
| Interquartile range (IQR) | 28052 |
Descriptive statistics
| Standard deviation | 18636.157 |
|---|---|
| Coefficient of variation (CV) | 0.56367838 |
| Kurtosis | 0.39039103 |
| Mean | 33061.685 |
| Median Absolute Deviation (MAD) | 13902 |
| Skewness | 0.60632612 |
| Sum | 2.0697937 × 1010 |
| Variance | 3.4730634 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 46109 | 188 | < 0.1% |
| 40003 | 188 | < 0.1% |
| 40005 | 188 | < 0.1% |
| 40007 | 188 | < 0.1% |
| 40009 | 188 | < 0.1% |
| 40011 | 188 | < 0.1% |
| 40013 | 188 | < 0.1% |
| 40015 | 188 | < 0.1% |
| 40017 | 188 | < 0.1% |
| 40019 | 188 | < 0.1% |
| Other values (3320) | 624160 | |
| (Missing) | 1880 | 0.3% |
| Value | Count | Frequency (%) |
| 60 | 188 | |
| 66 | 188 | |
| 69 | 188 | |
| 78 | 188 | |
| 1001 | 188 | |
| 1003 | 188 | |
| 1005 | 188 | |
| 1007 | 188 | |
| 1009 | 188 | |
| 1011 | 188 |
| Value | Count | Frequency (%) |
| 99999 | 188 | |
| 90056 | 188 | |
| 90055 | 188 | |
| 90054 | 188 | |
| 90053 | 188 | |
| 90051 | 188 | |
| 90050 | 188 | |
| 90049 | 188 | |
| 90048 | 188 | |
| 90047 | 188 |
Admin2
Text
| Distinct | 1978 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1128 |
| Missing (%) | 0.2% |
| Memory size | 38.4 MiB |
Length
| Max length | 41 |
|---|---|
| Median length | 21 |
| Mean length | 7.1541692 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Adjuntas |
|---|---|
| 2nd row | Aguada |
| 3rd row | Aguadilla |
| 4th row | Aguas Buenas |
| 5th row | Aibonito |
| Value | Count | Frequency (%) |
| of | 10716 | 1.5% |
| unassigned | 9776 | 1.4% |
| out | 9776 | 1.4% |
| washington | 5828 | 0.8% |
| jefferson | 5264 | 0.8% |
| st | 4888 | 0.7% |
| franklin | 4888 | 0.7% |
| jackson | 4512 | 0.7% |
| lincoln | 4512 | 0.7% |
| san | 3948 | 0.6% |
| Other values (1999) | 629800 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 443304 | 9.9% |
| e | 421120 | 9.4% |
| n | 376940 | 8.4% |
| o | 341408 | 7.6% |
| r | 292904 | 6.5% |
| l | 244212 | 5.4% |
| i | 235940 | 5.3% |
| s | 208492 | 4.6% |
| t | 203980 | 4.5% |
| u | 119944 | 2.7% |
| Other values (48) | 1595932 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4484176 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 443304 | 9.9% |
| e | 421120 | 9.4% |
| n | 376940 | 8.4% |
| o | 341408 | 7.6% |
| r | 292904 | 6.5% |
| l | 244212 | 5.4% |
| i | 235940 | 5.3% |
| s | 208492 | 4.6% |
| t | 203980 | 4.5% |
| u | 119944 | 2.7% |
| Other values (48) | 1595932 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4484176 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 443304 | 9.9% |
| e | 421120 | 9.4% |
| n | 376940 | 8.4% |
| o | 341408 | 7.6% |
| r | 292904 | 6.5% |
| l | 244212 | 5.4% |
| i | 235940 | 5.3% |
| s | 208492 | 4.6% |
| t | 203980 | 4.5% |
| u | 119944 | 2.7% |
| Other values (48) | 1595932 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4484176 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 443304 | 9.9% |
| e | 421120 | 9.4% |
| n | 376940 | 8.4% |
| o | 341408 | 7.6% |
| r | 292904 | 6.5% |
| l | 244212 | 5.4% |
| i | 235940 | 5.3% |
| s | 208492 | 4.6% |
| t | 203980 | 4.5% |
| u | 119944 | 2.7% |
| Other values (48) | 1595932 |
Province_State
Text
| Distinct | 58 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 39.0 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 16 |
| Mean length | 8.1739521 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | American Samoa |
|---|---|
| 2nd row | Guam |
| 3rd row | Northern Mariana Islands |
| 4th row | Puerto Rico |
| 5th row | Puerto Rico |
| Value | Count | Frequency (%) |
| texas | 48128 | 6.6% |
| virginia | 36096 | 4.9% |
| georgia | 30268 | 4.1% |
| north | 29516 | 4.0% |
| carolina | 28200 | 3.8% |
| new | 25192 | 3.4% |
| dakota | 23124 | 3.2% |
| kentucky | 22936 | 3.1% |
| missouri | 22184 | 3.0% |
| south | 21808 | 3.0% |
| Other values (57) | 446312 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 674168 | |
| i | 548960 | 10.7% |
| o | 428828 | 8.4% |
| n | 423940 | 8.3% |
| s | 415668 | 8.1% |
| e | 316404 | 6.2% |
| r | 270156 | 5.3% |
| t | 179916 | 3.5% |
| l | 158672 | 3.1% |
| h | 129720 | 2.5% |
| Other values (36) | 1586156 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5132588 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 674168 | |
| i | 548960 | 10.7% |
| o | 428828 | 8.4% |
| n | 423940 | 8.3% |
| s | 415668 | 8.1% |
| e | 316404 | 6.2% |
| r | 270156 | 5.3% |
| t | 179916 | 3.5% |
| l | 158672 | 3.1% |
| h | 129720 | 2.5% |
| Other values (36) | 1586156 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5132588 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 674168 | |
| i | 548960 | 10.7% |
| o | 428828 | 8.4% |
| n | 423940 | 8.3% |
| s | 415668 | 8.1% |
| e | 316404 | 6.2% |
| r | 270156 | 5.3% |
| t | 179916 | 3.5% |
| l | 158672 | 3.1% |
| h | 129720 | 2.5% |
| Other values (36) | 1586156 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5132588 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 674168 | |
| i | 548960 | 10.7% |
| o | 428828 | 8.4% |
| n | 423940 | 8.3% |
| s | 415668 | 8.1% |
| e | 316404 | 6.2% |
| r | 270156 | 5.3% |
| t | 179916 | 3.5% |
| l | 158672 | 3.1% |
| h | 129720 | 2.5% |
| Other values (36) | 1586156 |
Country_Region
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 35.3 MiB |
| US |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
Common Values
| Value | Count | Frequency (%) |
| US | 627920 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| us | 627920 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 627920 | |
| S | 627920 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1255840 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| U | 627920 | |
| S | 627920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1255840 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| U | 627920 | |
| S | 627920 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1255840 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| U | 627920 | |
| S | 627920 |
Lat
Real number (ℝ)
High correlation Zeros
| Distinct | 3226 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.707212 |
| Minimum | -14.271 |
|---|---|
| Maximum | 69.314792 |
| Zeros | 20304 |
| Zeros (%) | 3.2% |
| Negative | 188 |
| Negative (%) | < 0.1% |
| Memory size | 4.8 MiB |
Quantile statistics
| Minimum | -14.271 |
|---|---|
| 5-th percentile | 18.344964 |
| Q1 | 33.895587 |
| median | 38.002344 |
| Q3 | 41.573069 |
| 95-th percentile | 46.466812 |
| Maximum | 69.314792 |
| Range | 83.585792 |
| Interquartile range (IQR) | 7.6774818 |
Descriptive statistics
| Standard deviation | 9.0615719 |
|---|---|
| Coefficient of variation (CV) | 0.2468608 |
| Kurtosis | 7.137463 |
| Mean | 36.707212 |
| Median Absolute Deviation (MAD) | 3.8387588 |
| Skewness | -2.1144867 |
| Sum | 23049193 |
| Variance | 82.112085 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20304 | 3.2% |
| 40.12491499 | 376 | 0.1% |
| 39.37231946 | 376 | 0.1% |
| 37.85447192 | 376 | 0.1% |
| 38.99617072 | 376 | 0.1% |
| 41.52106798 | 376 | 0.1% |
| 41.27116049 | 376 | 0.1% |
| 41.40674725 | 376 | 0.1% |
| 39.96995815 | 188 | < 0.1% |
| 39.56021306 | 188 | < 0.1% |
| Other values (3216) | 604608 |
| Value | Count | Frequency (%) |
| -14.271 | 188 | < 0.1% |
| 0 | 20304 | |
| 13.4443 | 188 | < 0.1% |
| 15.0979 | 188 | < 0.1% |
| 17.982429 | 188 | < 0.1% |
| 17.994525 | 188 | < 0.1% |
| 17.998457 | 188 | < 0.1% |
| 18.007516 | 188 | < 0.1% |
| 18.010387 | 188 | < 0.1% |
| 18.011661 | 188 | < 0.1% |
| Value | Count | Frequency (%) |
| 69.31479216 | 188 | |
| 67.04919196 | 188 | |
| 65.50815459 | 188 | |
| 64.90320724 | 188 | |
| 64.80726247 | 188 | |
| 63.87692095 | 188 | |
| 63.67264044 | 188 | |
| 62.31305045 | 188 | |
| 62.1542916 | 188 | |
| 61.47502768 | 188 |
Long_
Real number (ℝ)
Zeros
| Distinct | 3226 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -88.601474 |
| Minimum | -174.1596 |
|---|---|
| Maximum | 145.6739 |
| Zeros | 20304 |
| Zeros (%) | 3.2% |
| Negative | 607240 |
| Negative (%) | 96.7% |
| Memory size | 4.8 MiB |
Quantile statistics
| Minimum | -174.1596 |
|---|---|
| 5-th percentile | -117.54927 |
| Q1 | -97.790204 |
| median | -89.48671 |
| Q3 | -82.311265 |
| 95-th percentile | -66.789985 |
| Maximum | 145.6739 |
| Range | 319.8335 |
| Interquartile range (IQR) | 15.478939 |
Descriptive statistics
| Standard deviation | 21.715747 |
|---|---|
| Coefficient of variation (CV) | -0.24509465 |
| Kurtosis | 15.177245 |
| Mean | -88.601474 |
| Median Absolute Deviation (MAD) | 7.7586698 |
| Skewness | 2.493298 |
| Sum | -55634638 |
| Variance | 471.57368 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 20304 | 3.2% |
| -109.5174415 | 376 | 0.1% |
| -111.5758676 | 376 | 0.1% |
| -111.4418764 | 376 | 0.1% |
| -110.7013958 | 376 | 0.1% |
| -113.0832816 | 376 | 0.1% |
| -111.9145117 | 376 | 0.1% |
| -70.68763497 | 376 | 0.1% |
| -83.01115755 | 188 | < 0.1% |
| -83.4562016 | 188 | < 0.1% |
| Other values (3216) | 604608 |
| Value | Count | Frequency (%) |
| -174.1596 | 188 | |
| -170.132 | 188 | |
| -164.0353804 | 188 | |
| -163.3967883 | 188 | |
| -161.9722021 | 188 | |
| -159.8561831 | 188 | |
| -159.7503946 | 188 | |
| -159.5966786 | 188 | |
| -158.2381942 | 188 | |
| -157.9712182 | 188 |
| Value | Count | Frequency (%) |
| 145.6739 | 188 | < 0.1% |
| 144.7937 | 188 | < 0.1% |
| 0 | 20304 | |
| -64.8963 | 188 | < 0.1% |
| -65.28813 | 188 | < 0.1% |
| -65.440971 | 188 | < 0.1% |
| -65.666416 | 188 | < 0.1% |
| -65.666866 | 188 | < 0.1% |
| -65.725097 | 188 | < 0.1% |
| -65.753897 | 188 | < 0.1% |
Combined_Key
Text
| Distinct | 3340 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 46.9 MiB |
Length
| Max length | 55 |
|---|---|
| Median length | 35 |
| Mean length | 21.300599 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | American Samoa, US |
|---|---|
| 2nd row | Guam, US |
| 3rd row | Northern Mariana Islands, US |
| 4th row | Adjuntas, Puerto Rico, US |
| 5th row | Aguada, Puerto Rico, US |
| Value | Count | Frequency (%) |
| us | 623972 | |
| texas | 48504 | 2.4% |
| virginia | 36096 | 1.8% |
| georgia | 30080 | 1.5% |
| north | 29704 | 1.5% |
| carolina | 28200 | 1.4% |
| new | 26508 | 1.3% |
| dakota | 23500 | 1.1% |
| kentucky | 22936 | 1.1% |
| south | 21808 | 1.1% |
| Other values (2049) | 1156576 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1419964 | 10.6% | |
| , | 1254712 | 9.4% |
| a | 1117472 | 8.4% |
| n | 800880 | 6.0% |
| i | 785088 | 5.9% |
| o | 770236 | 5.8% |
| e | 737524 | 5.5% |
| S | 705752 | 5.3% |
| U | 651044 | 4.9% |
| s | 624160 | 4.7% |
| Other values (49) | 4508240 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13375072 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1419964 | 10.6% | |
| , | 1254712 | 9.4% |
| a | 1117472 | 8.4% |
| n | 800880 | 6.0% |
| i | 785088 | 5.9% |
| o | 770236 | 5.8% |
| e | 737524 | 5.5% |
| S | 705752 | 5.3% |
| U | 651044 | 4.9% |
| s | 624160 | 4.7% |
| Other values (49) | 4508240 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13375072 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1419964 | 10.6% | |
| , | 1254712 | 9.4% |
| a | 1117472 | 8.4% |
| n | 800880 | 6.0% |
| i | 785088 | 5.9% |
| o | 770236 | 5.8% |
| e | 737524 | 5.5% |
| S | 705752 | 5.3% |
| U | 651044 | 4.9% |
| s | 624160 | 4.7% |
| Other values (49) | 4508240 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13375072 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1419964 | 10.6% | |
| , | 1254712 | 9.4% |
| a | 1117472 | 8.4% |
| n | 800880 | 6.0% |
| i | 785088 | 5.9% |
| o | 770236 | 5.8% |
| e | 737524 | 5.5% |
| S | 705752 | 5.3% |
| U | 651044 | 4.9% |
| s | 624160 | 4.7% |
| Other values (49) | 4508240 |
Date
Date
| Distinct | 188 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.8 MiB |
| Minimum | 2020-01-22 00:00:00 |
|---|---|
| Maximum | 2020-07-27 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Confirmed
Real number (ℝ)
High correlation Skewed Zeros
| Distinct | 11091 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 357.28428 |
| Minimum | 0 |
|---|---|
| Maximum | 224051 |
| Zeros | 253223 |
| Zeros (%) | 40.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4 |
| Q3 | 63 |
| 95-th percentile | 987 |
| Maximum | 224051 |
| Range | 224051 |
| Interquartile range (IQR) | 63 |
Descriptive statistics
| Standard deviation | 3487.2827 |
|---|---|
| Coefficient of variation (CV) | 9.7605264 |
| Kurtosis | 2053.3089 |
| Mean | 357.28428 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 39.356898 |
| Sum | 2.2434595 × 108 |
| Variance | 12161141 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 253223 | |
| 1 | 24942 | 4.0% |
| 2 | 15883 | 2.5% |
| 3 | 13001 | 2.1% |
| 4 | 10569 | 1.7% |
| 5 | 9394 | 1.5% |
| 6 | 8403 | 1.3% |
| 7 | 6962 | 1.1% |
| 8 | 6341 | 1.0% |
| 9 | 5542 | 0.9% |
| Other values (11081) | 273660 |
| Value | Count | Frequency (%) |
| 0 | 253223 | |
| 1 | 24942 | 4.0% |
| 2 | 15883 | 2.5% |
| 3 | 13001 | 2.1% |
| 4 | 10569 | 1.7% |
| 5 | 9394 | 1.5% |
| 6 | 8403 | 1.3% |
| 7 | 6962 | 1.1% |
| 8 | 6341 | 1.0% |
| 9 | 5542 | 0.9% |
| Value | Count | Frequency (%) |
| 224051 | 1 | |
| 223761 | 1 | |
| 223532 | 1 | |
| 223192 | 1 | |
| 222832 | 1 | |
| 222444 | 1 | |
| 222094 | 1 | |
| 221703 | 1 | |
| 221419 | 1 | |
| 221121 | 1 |
Deaths
Real number (ℝ)
High correlation Skewed Zeros
| Distinct | 2011 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.536328 |
| Minimum | 0 |
|---|---|
| Maximum | 23500 |
| Zeros | 428930 |
| Zeros (%) | 68.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 34 |
| Maximum | 23500 |
| Range | 23500 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 300.99147 |
|---|---|
| Coefficient of variation (CV) | 17.163882 |
| Kurtosis | 4589.5806 |
| Mean | 17.536328 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 63.590888 |
| Sum | 11011411 |
| Variance | 90595.862 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 428930 | |
| 1 | 50238 | 8.0% |
| 2 | 24941 | 4.0% |
| 3 | 15125 | 2.4% |
| 4 | 10637 | 1.7% |
| 5 | 7450 | 1.2% |
| 6 | 6652 | 1.1% |
| 7 | 5225 | 0.8% |
| 8 | 4606 | 0.7% |
| 9 | 3912 | 0.6% |
| Other values (2001) | 70204 | 11.2% |
| Value | Count | Frequency (%) |
| 0 | 428930 | |
| 1 | 50238 | 8.0% |
| 2 | 24941 | 4.0% |
| 3 | 15125 | 2.4% |
| 4 | 10637 | 1.7% |
| 5 | 7450 | 1.2% |
| 6 | 6652 | 1.1% |
| 7 | 5225 | 0.8% |
| 8 | 4606 | 0.7% |
| 9 | 3912 | 0.6% |
| Value | Count | Frequency (%) |
| 23500 | 1 | |
| 23485 | 1 | |
| 23476 | 1 | |
| 23465 | 1 | |
| 23463 | 1 | |
| 23428 | 1 | |
| 23424 | 1 | |
| 23411 | 1 | |
| 23400 | 1 | |
| 23388 | 1 |
Interactions
Correlations
| Confirmed | Deaths | FIPS | Lat | Long_ | UID | code3 | iso2 | iso3 | |
|---|---|---|---|---|---|---|---|---|---|
| Confirmed | 1.000 | 0.778 | -0.095 | -0.036 | 0.106 | -0.071 | 0.042 | 0.000 | 0.000 |
| Deaths | 0.778 | 1.000 | -0.117 | -0.060 | 0.137 | -0.062 | 0.102 | 0.000 | 0.000 |
| FIPS | -0.095 | -0.117 | 1.000 | -0.068 | 0.202 | 0.868 | -0.235 | 0.426 | 0.426 |
| Lat | -0.036 | -0.060 | -0.068 | 1.000 | -0.291 | 0.063 | 0.248 | 0.629 | 0.629 |
| Long_ | 0.106 | 0.137 | 0.202 | -0.291 | 1.000 | 0.068 | -0.242 | 0.493 | 0.493 |
| UID | -0.071 | -0.062 | 0.868 | 0.063 | 0.068 | 1.000 | 0.265 | 1.000 | 1.000 |
| code3 | 0.042 | 0.102 | -0.235 | 0.248 | -0.242 | 0.265 | 1.000 | 1.000 | 1.000 |
| iso2 | 0.000 | 0.000 | 0.426 | 0.629 | 0.493 | 1.000 | 1.000 | 1.000 | 1.000 |
| iso3 | 0.000 | 0.000 | 0.426 | 0.629 | 0.493 | 1.000 | 1.000 | 1.000 | 1.000 |
Missing values
Sample
| UID | iso2 | iso3 | code3 | FIPS | Admin2 | Province_State | Country_Region | Lat | Long_ | Combined_Key | Date | Confirmed | Deaths | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 16 | AS | ASM | 16 | 60.0 | NaN | American Samoa | US | -14.271000 | -170.132000 | American Samoa, US | 1/22/20 | 0 | 0 |
| 1 | 316 | GU | GUM | 316 | 66.0 | NaN | Guam | US | 13.444300 | 144.793700 | Guam, US | 1/22/20 | 0 | 0 |
| 2 | 580 | MP | MNP | 580 | 69.0 | NaN | Northern Mariana Islands | US | 15.097900 | 145.673900 | Northern Mariana Islands, US | 1/22/20 | 0 | 0 |
| 3 | 63072001 | PR | PRI | 630 | 72001.0 | Adjuntas | Puerto Rico | US | 18.180117 | -66.754367 | Adjuntas, Puerto Rico, US | 1/22/20 | 0 | 0 |
| 4 | 63072003 | PR | PRI | 630 | 72003.0 | Aguada | Puerto Rico | US | 18.360255 | -67.175131 | Aguada, Puerto Rico, US | 1/22/20 | 0 | 0 |
| 5 | 63072005 | PR | PRI | 630 | 72005.0 | Aguadilla | Puerto Rico | US | 18.459681 | -67.120815 | Aguadilla, Puerto Rico, US | 1/22/20 | 0 | 0 |
| 6 | 63072007 | PR | PRI | 630 | 72007.0 | Aguas Buenas | Puerto Rico | US | 18.251619 | -66.126806 | Aguas Buenas, Puerto Rico, US | 1/22/20 | 0 | 0 |
| 7 | 63072009 | PR | PRI | 630 | 72009.0 | Aibonito | Puerto Rico | US | 18.131361 | -66.264131 | Aibonito, Puerto Rico, US | 1/22/20 | 0 | 0 |
| 8 | 63072011 | PR | PRI | 630 | 72011.0 | Anasco | Puerto Rico | US | 18.287985 | -67.120611 | Anasco, Puerto Rico, US | 1/22/20 | 0 | 0 |
| 9 | 63072013 | PR | PRI | 630 | 72013.0 | Arecibo | Puerto Rico | US | 18.406631 | -66.675077 | Arecibo, Puerto Rico, US | 1/22/20 | 0 | 0 |
| UID | iso2 | iso3 | code3 | FIPS | Admin2 | Province_State | Country_Region | Lat | Long_ | Combined_Key | Date | Confirmed | Deaths | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 627910 | 84070002 | US | USA | 840 | NaN | Dukes and Nantucket | Massachusetts | US | 41.406747 | -70.687635 | Dukes and Nantucket,Massachusetts,US | 7/27/20 | 95 | 24 |
| 627911 | 84070003 | US | USA | 840 | NaN | Kansas City | Missouri | US | 39.099700 | -94.578600 | Kansas City,Missouri,US | 7/27/20 | 4949 | 3 |
| 627912 | 84070004 | US | USA | 840 | NaN | Michigan Department of Corrections (MDOC) | Michigan | US | 0.000000 | 0.000000 | Michigan Department of Corrections (MDOC), Michigan, US | 7/27/20 | 4124 | 68 |
| 627913 | 84070005 | US | USA | 840 | NaN | Federal Correctional Institution (FCI) | Michigan | US | 0.000000 | 0.000000 | Federal Correctional Institution (FCI), Michigan, US | 7/27/20 | 192 | 5 |
| 627914 | 84070015 | US | USA | 840 | NaN | Bear River | Utah | US | 41.521068 | -113.083282 | Bear River, Utah, US | 7/27/20 | 2099 | 5 |
| 627915 | 84070016 | US | USA | 840 | NaN | Central Utah | Utah | US | 39.372319 | -111.575868 | Central Utah, Utah, US | 7/27/20 | 347 | 1 |
| 627916 | 84070017 | US | USA | 840 | NaN | Southeast Utah | Utah | US | 38.996171 | -110.701396 | Southeast Utah, Utah, US | 7/27/20 | 70 | 0 |
| 627917 | 84070018 | US | USA | 840 | NaN | Southwest Utah | Utah | US | 37.854472 | -111.441876 | Southwest Utah, Utah, US | 7/27/20 | 2781 | 23 |
| 627918 | 84070019 | US | USA | 840 | NaN | TriCounty | Utah | US | 40.124915 | -109.517442 | TriCounty, Utah, US | 7/27/20 | 142 | 0 |
| 627919 | 84070020 | US | USA | 840 | NaN | Weber-Morgan | Utah | US | 41.271160 | -111.914512 | Weber-Morgan, Utah, US | 7/27/20 | 2375 | 24 |